Generating Coherent Summaries of Scientific Articles Using Coherence Patterns

نویسندگان

  • Daraksha Parveen
  • Mohsen Mesgar
  • Michael Strube
چکیده

Previous work on automatic summarization does not thoroughly consider coherence while generating the summary. We introduce a graph-based approach to summarize scientific articles. We employ coherence patterns to ensure that the generated summaries are coherent. The novelty of our model is twofold: we mine coherence patterns in a corpus of abstracts, and we propose a method to combine coherence, importance and non-redundancy to generate the summary. We optimize these factors simultaneously using Mixed Integer Programming. Our approach significantly outperforms baseline and state-of-the-art systems in terms of coherence (summary coherence assessment) and relevance (ROUGE scores).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Surveyor: A System for Generating Coherent Survey Articles for Scientific Topics

We investigate the task of generating coherent survey articles for scientific topics. We introduce an extractive summarization algorithm that combines a content model with a discourse model to generate coherent and readable summaries of scientific topics using text from scientific articles relevant to the topic. Human evaluation on 15 topics in computational linguistics shows that our system pr...

متن کامل

Generating Coherent Summaries with Textual Aspects

Initiated by TAC 2010, aspect guided summaries not only address specific user need, but also ameliorate content level coherence by using aspect information. This paper presents a full fledged system composed of three modules: finding sentence level textual aspects, modeling aspect based co herence with an HMM model, and selecting and ordering sentences with aspect information to generate cohere...

متن کامل

NLP Driven Models for Automatically Generating Survey Articles for Scientific Topics

This thesis presents new methods that use natural language processing (NLP) driven models for summarizing research in scientific fields. Given a topic query in the form of a text string, we present methods for finding research articles relevant to the topic as well as summarization algorithms that use lexical and discourse information present in the text of these articles to generate coherent a...

متن کامل

Generating Coherent Extracts of Single Documents Using Latent Semantic Analysis

Generating Coherent Extracts of Single Documents Using Latent Semantic Analysis Tristan Miller Master of Science Graduate Department of Computer Science University of Toronto 2003 A major problem with automatically-produced summaries in general, and extracts in particular, is that the output text often lacks textual coherence. Our goal is to improve the textual coherence of automatically produc...

متن کامل

Generating Single and Multi-Document Summaries with GISTEXTER

This paper presents the techniques implemented in GISTEXTER for producing extracts and abstracts from both single and multiple documents. These techniques promote the belief that highly coherent summaries may be generated when using textual information identified by the Information Extraction technology. The results of GISTEXTER in the DUC-2002 evaluations account for the advantages of using th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016